The statistical significance of the MUC-4 results
نویسنده
چکیده
The MUC-4 scores of recall, precision, and the F-measures are used to measure the performance of the participating systems. The differences in the scores between any two systems may be due to chance or may be due to a significant difference between the two systems. To rule out the possibility that the difference is due to chance, statistical hypothesis testing is used. The method of hypothesis testing used is a computationally-intensive method known as approximate randomization. The method and the statistical significance of the results for the two MUC-4 test sets, TST3 and TST4, will be discussed in this paper.
منابع مشابه
The statistical significance of the MUC-5 results
The statistical significance of the results of the MUC-5 evaluation is determined using a computer-intensiv e method of hypothesis testing known as approximate randomization . The exact method is described in detail in 111 an d [2] and has been used as the accepted statistical test for the MUC results since MUC-3 . The purpose of the statistica l testing is to determine whether the scores of th...
متن کاملStatistical significance of MUC-6 results
The results of the MUC-6 evaluation must be analyzed to determine whether close scores significantl y distinguish systems or whether the differences in those scores are a matter of chance. In order to do such an analysis , a method of computer intensive hypothesis testing was developed by SAIC for the MUC-3 results and has been use d for distinguishing MUC scores since that time . The implement...
متن کاملEvaluating Message Understanding Systems: An Analysis of the Third Message Understanding Conference (MUC-3)
This paper describes and analyzes the results of the Third Message Understanding Conference (MUC-3). It reviews the purpose, history, and methodology of the conference, summarizes the participating systems, discusses issues of measuring system effectiveness, describes the linguistic phenomena tests, and provides a critical look at the evaluation in terms of the lessons learned. One of the commo...
متن کاملClinicopathological and prognostic significance of MUC-2, MUC-4 and MUC-5AC expression in japanese gastric carcinomas.
BACKGROUND The mucin components of the gastric gel layer function as a protective and lubricating factor against luminal acid and proteolytic enzymes. Alteration of mucin expression in gastric preneoplastic and neoplastic lesions has suggested potential roles in neoplastic processes. This study aimed to assess the clinicopathological and prognostic significance of MUC-2, MUC-4 and MUC-5AC in Ja...
متن کاملStatistical and Practical Significance of Articles at Sports Biomechanics Conferences
Background. The importance of using statistical approaches has increased and became necessary for researchers and specialists in sports biomechanics because they need more objective and accurate methods to increase knowledge. Objectives. Evaluate the reality of using practical significance in the articles published in scientific conferences in the biomechanical sport. Methods. One hundred twe...
متن کامل